AMENDMENTS TO THE CLAIMS: 



Please amend the claims as follows: 

1. (Currently amended) A computer-implemented process for generatin g cr e ating a filter 
for selecting documents, comprising: 

identifying a plurality of profiles fi'om a set of training documents : 
creating a sub-filter for each of said plurality of profiles, each of said sub-filters having 
an input and an output^ 

connecting each of said inputs at a single node; and 

combining each of said outputs to thereby form said filter for selecting documents; and 
storing said filter for selecting documents in a computer readable medium, said filter for 
selecting documents being accessible by computer readable program code for filtering and 
selecting documents: 

wherein creating each sub-filter comprises the steps of 

selecting a first subset and a second subset from a set of training documents, 

extracting a set of features from the first subset, and 

calculating a score threshold for the set of features from the second subset. 

2. (Currently amended) A computer-implemented process, as in claim 1, further 
comprising: 

mutipl e xin gm ultiplexing said outputs of said sub-filters to create a first filter output. 

3. (Currently amended) A computer-implemented process, as in claim 2, fiirther 
comprising: 

combining said first filter output with at least one sub-filter output to create at least one 
second filter output. 

4. (Currently amended) A computer-implemented process, as in claim 3, fiirther 
comprising: 

aggregating said first filter output and said second filter output using a ftinction. 
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5. (Currently amended) A computer-implemented process, as in claim 4, wherein said 
function is a linear function. 

6. (Currently amended) A computer-implemented process, as in claim 4, wherein said 
function is a weighted function. 

7. (Currently amended) A computer-implemented process, as in claim 4, wherein said 
function is a threshold e dt hreshold aggregation function. 

8. (Currently amended) A computer-implemented process for generatin g cr e ating an 
ensemble filter for selecting documents from a document source str e am of documents, 
comprising: 

identifying a plurality of sets of documents from said str e am of docum e nts a set of 
training documents ; 

identifying a plurality of profiles corresponding to said plurality of sets of documents; 
creating a plurality of sub-filters using for said plurality of profiles; aftd 
combining a first one of said plurality of sub-filters with a second one of said plurality of 
sub-filters to thereby create an ensemble filter ; and 

storing said ensemble filter for selecting documents in a computer readable medium, said 
filter for selecting documents being accessible by computer readable program code for filtering 
and selecting documents; 

wherein creating each sub-filter comprises the steps of 

selecting a first subset and a second subset from a set of documents. 

extracting a set of features from the first subset, and 

calculating a score threshold for the set of features from the second subset. 

9-15 (Canceled) 

16. (New) A computer-implemented process, as in claim 8, wherein the document 
source is a database of documents. 
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1 7. (New) A computer-implemented process, as in claim 8, wherein the document 
source is a stream of documents. 

1 8. (New) A computer-implemented process for generating a filtering profile for 
selecting documents comprising the steps of: 

selecting a first subset of training documents and a second subset of training 
documents from a set of training documents; 

creating a filtering profile having an input and an output by extracting a set of 
features from the first subset of training documents; 

creating a score threshold for the filtering profile by calculating an optimal score 
threshold for the set of features from the second subset of training documents; and 

storing said filtering profile for selecting documents in a computer readable 
medium, said filtering profile for selecting documents being accessible by computer readable 
program code for filtering and selecting documents. 

19. (New) The computer-implemented process of claim 1 8 wherein extracting a set of 
features further comprises identifying features in the first subset using a term fi-equency inverse 
document frequency ("TF-IDF") score. 

20. (New) The computer-implemented process of claim 1 8 wherein extracting a set of 
features further comprises performing a Rocchio algorithm. 

2 1 . (New) The computer-implemented process of claim 1 8 wherein calculating an 
optimal score threshold further comprises performing a beta-gamma thresholding algorithm. 

22. (New) An apparatus for generating a document filter, comprising: 
a processing system; and 

a memory coupled to the processing system, wherein the processor is configured 

to: 

select a first subset of training documents and a second subset of training 
documents from a set of training documents; 
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create a filtering profile having an input and an output by extracting a set of 
features firom the first subset of training documents; 

create a score threshold for the filtering profile by calculating an optimal score 
threshold for the set of features from the second subset of training documents; and 

store said filtering profile for selecting documents in a computer readable 
medium, said filtering profile for selecting documents being accessible by computer readable 
program code for filtering and selecting documents. 

23. (New) An article of manufacture comprising a computer readable medium having 
executable program code embodied therein for generating a document filter, wherein the 
executable program code is adapted to cause the processing system to: 

select a first subset of training documents and a second subset of training 
documents from a set of training documents; 

create a filtering profile having an input and an output by extracting a set of 
features fi^om the first subset of training documents; 

create a score threshold for the filtering profile by calculating an optimal score 
threshold for the set of features fi'om the second subset of training documents; and 

store said filtering profile for selecting documents in a computer readable 
medium, said filtering profile for selecting documents being accessible by computer readable 
program code for filtering and selecting documents. 
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